Employing Emotion Cues to Verify Speakers in Emotional Talking Environments

نویسنده

Ismail Shahin

چکیده

Usually, people talk neutrally in environments where there are no abnormal talking conditions such as stress and emotion. Other emotional conditions that might affect people talking tone like happiness, anger, and sadness. Such emotions are directly affected by the patient health status. In neutral talking environments, speakers can be easily verified, however, in emotional talking environments, speakers cannot be easily verified as in neutral talking ones. Consequently, speaker verification systems do not perform well in emotional talking environments as they do in neutral talking environments. In this work, a two-stage approach has been employed and evaluated to improve speaker verification performance in emotional talking environments. This approach employs speaker emotion cues (text-independent and emotion-dependent speaker verification problem) based on both Hidden Markov Models (HMMs) and Suprasegmental Hidden Markov Models (SPHMMs) as classifiers. The approach is comprised of two cascaded stages that combines and integrates emotion recognizer and speaker recognizer into one recognizer. The architecture has been tested on two different and separate emotional speech databases: our collected database and Emotional Prosody Speech and Transcripts database. The results of this work show that the proposed approach gives promising results with a significant improvement over previous studies and other approaches such as emotion-independent speaker verification approach and emotion-dependent speaker verification approach based completely on HMMs.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Employing both gender and emotion cues to enhance speaker identification performance in emotional talking environments

Speaker recognition performance in emotional talking environments is not as high as it is in neutral talking environments. This work focuses on proposing, implementing, and evaluating a new approach to enhance the performance in emotional talking environments. The new proposed approach is based on identifying the unknown speaker using both his/her gender and emotion cues. Both Hidden Markov Mod...

متن کامل

Perception by Japanese, Korean and American listeners to a Korean speaker’s recollection of past emotional events: Some acoustic cues

Acoustic and perceptual analyses of spontaneous Korean were made of a Korean woman recalling past emotional events in her life. A subset of 20 single word utterances and 20 isolated vowels were presented to Japanese, American and Korean listeners who were asked to (1) rate the intensity of the perceived emotion and (2) identify the perceived emotion. Listeners could rate intensity and identify ...

متن کامل

Speaker identification investigation and analysis in unbiased and biased emotional talking environments

This work aims at investigating and analyzing speaker identification in each unbiased and biased emotional talking environments based on a classifier called Suprasegmental Hidden Markov Models (SPHMMs). The first talking environment is unbiased towards any emotion, while the second talking environment is biased towards different emotions. Each of these talking environments is made up of six dis...

متن کامل

The Effects of Culture and Gender on the Recognition of Emotional Speech: Evidence from Persian Speakers Living in a Collectivist Society

This paper reports on a behavioral study that explores the role of culture and gender in the recognition of emotional speech in an under investigated cultural context (a collectivist society: i.e., Iran). Participants were asked to recognize the emotional prosody of a set of validated emotional vocal portrayals (including the five basic emotions). Findings of the experiment were then comp...

متن کامل

Language, Emotion and Metapragmatics: A Theory Based on Typological Evidence

Humans are equipped with some universal or language-specific abilities to recognize emotions. However, because of the different emotional contents in diverse languages and the relevant cultural differences, humans with different cultural backgrounds own different metapragmatical abilities to recognize and express emotions. A hypothesis concerning emotional effects about intonation and particle ...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

J. Intelligent Systems

دوره 25 شماره

صفحات -

تاریخ انتشار 2016

Employing Emotion Cues to Verify Speakers in Emotional Talking Environments

نویسنده

چکیده

منابع مشابه

Employing both gender and emotion cues to enhance speaker identification performance in emotional talking environments

Perception by Japanese, Korean and American listeners to a Korean speaker’s recollection of past emotional events: Some acoustic cues

Speaker identification investigation and analysis in unbiased and biased emotional talking environments

The Effects of Culture and Gender on the Recognition of Emotional Speech: Evidence from Persian Speakers Living in a Collectivist Society

Language, Emotion and Metapragmatics: A Theory Based on Typological Evidence

عنوان ژورنال:

اشتراک گذاری